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(54) Title: PROTEIN L AND HYBRID PROTEINS THEREOF 



(57) Abstract 

The invention relates to sequences of protein L which bind to light chains of immunoglobulins. The invention also relates 
to hybrid proteins thereof which are able to bind to both light and heavy chains of immunoglobulin G, in particular protein LG. 
The invention also relates to DNA-sequences which code for the proteins, vectors which include such DNA-sequences, host cells 
which have been transformed with the vectors, methods for producing the proteins, reagent appliances for separation and ident- 
ification of immunoglobulins, compositions and pharmaceutical compositions which contain the proteins. 
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Protein L and Hybrid Proteins Thereof 

The present invention relates to sequences of protein L 
which bind to light chains of immunoglobulins. The 
5 invention also relates to hybrid proteins of protein L 

having the ability to bind to light chains of all Ig and 
also to bind to light and heavy chains of immunoglobulin 
G, DNA-sequences which code for the proteins vectors 
that contain such DNA-sequences, host cells transformed 
10 by the vectors, methods for preparing the proteins, 

reagent apparatus for separating and identifying immuno- 
globulins, compositions and pharmaceutical compositions 
which contain the proteins. 

15 The invention relates in particular to the DNA-sequence 

and to the amino acid sequence of the light-chain form- 
ing domains of protein L. 

Proteins which bind to the constant domains (of high 
20 affinity) of the immunoglobulins (Ig) are known. Thus, 

protein A (from Staphylococcus aureus ) (Forsgren, A. and 
Sjoquist, J. (1966) Protein A from Staphylococcus 
aureus* I. Pseudo— immune reaction with human gamma- 
globulin. J. Immunol. 97: 822-827) binds to IgG from 
25 various mammal species. The binding of protein A to IgG 

is mediated essentially via surfaces in the Fc-fragment 
of the heavy chain of the IgG-molecule, although a 
certain bond is also effected with surfaces in the Fab- 
fragment of the IgG. Protein A lacks the ability of 
3 0 binding to human IgG3 and neither will it bind to IgG 

from several other animal species, such as important 
laboratory animals, for instance rats and goats, which 
limits the use of protein A. 

35 Protein G (Bjorck, L. and Kronvall, G. (1984) Purifica- 

tion and some properties of streptococcal protein G, a 



SUBSTITUTE SHEET 



WO 93/22342 



PCT/SE93/00375 



2 

novel IgG-binding reagent. J. Immunol. 13 3: 969-974; 
R is, K., Ayoub, E. and Boyle, M. (1984) Streptococcal 
Fc receptors. I. Isolation and partial characterization 
of the receptor from a group C streptococcus . J . 
5 Immunol. 132: 3091-3097) binds to heavy chains in human 

IgG and to all four of its subclasses and also to IgG 
from most mammals, including rats and goats. 

Protein H (Akesson, P., Cooney, J., Kishimoto, F. and 
10 Bjorck, L. (1990) Protein H - a novel IgG binding bacte- 

rial protein. Molec. Immun. 27: 523-531) binds to the 
Fc-fragment in IgG from human beings, monkeys and rab- 
bits. However, the bond is weaker than in the case of 
protein G and A, which may be beneficial when wishing to 
15 break the bond with a weak agent, for instance when 

purifying proteins which are readily denatured with the 
aid of antibodies* 

Protein M (Applicant's Patent Application PCT/SE 
20 91100447) binds to the Fc-fragment in IgG from humans, 

monkeys, rabbits, goats, mice and pigs. 

Protein L (Bjorck, L. (1988) Protein L, a novel bacteri- 
al cell wall protein with affinity to Ig L chains. J. 
25 Immunol. 140: 1194-1197) , which binds to the light 

chains in immunoglobulins from all of the classes G, A, 
M, D and E is known (USP 4,876,194). The amino acid se- 
quence and the binding domains of this protein, however, 
have hitherto been unknown. 

30 

The aforesaid proteins can be used in the analysis, 
purification and preparation of antibodies and for 
diagnostic and biological research. 

35 The elimination of immunoglobulins, with the aid of 
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plasmaph resis, can have a favourable effect on some 
autoimmune diseases. A broadly binding protein would be 
an advantage wh n wishing to eliminate all classes of 
antibodies in this context. 

5 

It has long been known that infectious conditions can be 
prevented or cured with the introduction of an immune 
serum , i.e. a serum which is rich in antibodies against 
the organism concerned or its potentially harmful pro- 

10 duct. Examples hereof are epidemic jaundice, tetanus, 

diphtheria, rabies and generalized shingles. Antibodies 
against a toxic product may also be effective in the 
case of non- infectious occasioned conditions. Serum 
produced in animals against different snake venoms is 

15 the most common application in this respect. However, 

the administration of sera or antibody preparations is 
not totally without risk. Serious immunological reac- 
tions can occur in some cases. Singular cases of the 
transmission of contagious diseases, such as HIV and 

20 hepatitis through the agency of these products have also 

been described. In order to avoid these secondary ef- 
fects, it has been desirable to produce therapeutic 
antibodies in test tubes. A large number of novel tech- 
niques for the preparation of antibodies in test tubes 

25 have been proposed in recent years. Examples of such 

techniques are hybridom techniques, synthesis of chima- 
antibodies and the preparation of antibodies in bacte- 
ria. These techniques also enable antibodies to be 
specially designed which can further widen the use of 

3 0 such molecules as therapeutics, for instance in the case 

of certain tumour-diseases. In the case of some of these 
novel methods, however, the product totally lacks the 
Fc-fragment to which all of the described IgG-binding 
proteins, with the exception of protein L, bind. There 

3 5 is consequently a need of a process for purifying anti- 
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bodies for therapeutic use, wherein proteins which have 
a broad binding activity/ specificity , can be of value. 



It has long been possible to utilize the antibody reac- 
5 tion with its high grade specificity for diagnosing past 

or, in some cases, ongoing infections with different 
parasites. This indirect method of indicating infectious 
agents is called serology and, in many cases, may be the 
only diagnostic alternative. In certain cases, it can 

10 also be of interest to exhibit specific IgE- or IgA- 

antibodies. When diagnosing with the aid of serology, 
the antigen is most often fastened to a solid phase, 
whereafter serum taken from the patient is incubated 
with the antigen. Antibodies that have been bound from 

15 the patient can then be detected in different ways, 

often with the aid of a secondary antibody (for in- 
stance, an antibody which is directed against the light 
chains of human antibodies) to which an identifiable 
label has been attached, such as alkaline phosphatase, 

20 biotin, radioactive isotopes, fluorescein, etc. In this 

context, a protein having a broad Ig binding capacity 
can be used as an alternative to secondary antibodies. 

There are a number of non-therapeutic and non-diagnostic 
25 reasons for the necessity to bind antibodies. Antibodies 

are often used in research, both for detection and for 
purifying the antigen against which they are directed. 
All techniques which facilitate the purification of 
antibodies and, in particular, techniques which enable 
30 different classes to be purified, are of interest in 

this context . 

Consequently, there is a serious need of a protein which 
has a broad binding activity/ specificity and which binds 
3 5 to several different classes of immunoglobulins from 

different animal species. At present, there is no known 
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protein which will bind to all immunoglobulin classes. 
The earlier known proteins A, G, H and M bind only to 
heavy chains in IgG. The known protein L (Bjorck et al, 
1988) binds to the light x" chains and 7-chains in im- 
5 munoglobulins of all classes, although the bonds are 

much weaker on the *-chains. Applicant has charted pro- 
tein L, has determined the amino acid sequence for 
protein L, has identified the light-chain binding do- 
mains on protein L, and has used these to produce hybrid 

10 proteins which possess the IgG-Fc-binding domains of 

protein G. The Applicant is able to show through protein 
LG that a protein of broader binding activity/ 
specificity can be produced thereby. The aforesaid 
proteins A, G, H and M bind to the same surfaces, or to 

15 very closely lying surfaces on IgG-Fc. The protein L 

which binds to light chains can thus be combined with 
any other functionally similar protein which binds to 
the Fc-fragment of heavy chains. A similar broadening of 
the Ig-binding activity is achieved with all alterna- 

20 tives. 



Thus, the present invention relates to the sequence of 
protein L which binds to light chains in Ig and has the 
amino acid sequence disclosed in Figure 1, and variants, 
25 subf ragments r multiples or mixtures of the domains B1-B5 

having the same binding properties. The invention also 
relates to a DNA-sequence which codes for such protein 
sequences, for instance the DNA-sequence in Figure 1. 

30 The invention is concerned with a hybrid protein which 

is characterized by comprising domains which bind to the 
light x-chains and X-chains in immunoglobulins of all 
classes, and also comprises domains which bind to heavy 
chains in immunoglobulin G, wherein those domains which 

3 5 bind to the light chains are chosen from among the B1-, 

B2-, B3-, B4- and B5-domains in protein L (see Claim 1) 
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and those domains which bind to heavy chains of immuno- 
globulins are chosen from the C1-, C2- and C3 -domains in 
protein G; the A-, B- and Cl-domains from protein H; the 
A-, B1-, B2- and S-domains in protein Ml or the E-, D-, 
5 A- , B— and C-domains in protein A (see Figure 6) and 

variants, subf ragments , multiples or mixtures of these 
domains that have the same binding properties which bind 
to heavy chains of immunoglobulins, 

10 By subfragment is meant a part-fragment of the given 

domains or fragments which include parts from the vari- 
ous domains having mutually the same binding properties. 
By variants is meant proteins or peptides in which the 
original amino acid sequence has been modified or 

15 changed by insertion, addition, substitution, inversion 

or exclusion of one or more amino acids, although while 
retaining or improving the binding properties. The 
invention also relates to those proteins which contain 
several arrays (multiples) of the binding domains or 

20 mixtures of the binding domains with retained binding 

properties. The invention also relates to mixtures of 
the various domains of amino acid sequences having 
mutually the same binding properties. 

25 The invention relates in particular to a hybrid protein 

designated LG, and is characterized in that the hybrid 
protein includes the B-domains in protein L which bind 
to the light chains in immunoglobulins, and the Cl- 
domains and C2 -domains in protein G which bind to heavy 

3 0 chains and have the amino acid sequence disclosed in 

Figure 3. The invention also relates to variants, sub- 
fragments, multiples or mixtures of these domains. 

Protein LG is a hybrid protein having a molecular weight 
35 of about 50 kDa (432 amino acids) and comprising four 

domains, each of which binds to light chains in immuno- 
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globulins, and two IgG-binding domains from protein G. 
The hybrid protein combines a broad IgG-binding activi- 
ty, deriving from the high-grade binding ability of 
protein G to the Fc-fragment of the heavy chain on IgG 
5 with the ability of the protein L to bind to light 

chains of all classes of immunoglobulins. Thus, protein 
LG binds polyclonal human IgG, IgM, IgA, IgD and^IgE. 
The affinity for human polyclonal IgG is 2 x 10 M 
All four human immunoglobulin classes are bound. Binding 

10 to human IgG is effected with both the x-and the X- 

chain. Both the Fc-fragment and the Fab-fragment of IgG 
are bound to the hybrid protein. The protein also binds 
human IgA-, IgD-, IgE- and IgM-antibodies . The bond is 
stronger to human immunoglobulins which carry x than to 

15 those which carry the X-isotope of light chains. IgG 

from most mammals will be bound by protein LG, thus also 
IgG from goats and cows, which do not bind to protein L. 
However, rabbit-IgG which binds relatively weakly to 
protein L will bind well to the fusion protein. IgM and 

20 IgA-antibodies from mice, rats and rabbits will be bound 

to the protein. 



Protein LG is highly soluble. It is able to withstand 
heat and will retain its binding properties even at high 

25 temperatures. The binding properties also remain in a 

broad pH-range of 3-10. The protein withstands detergent 
and binds marked or labelled proteins subsequent to 
separation in SDS-PAGE and transference to membranes 
with elektroblotting. The protein can be immobilized on 

30 a solid phase (nitrocellulose, Immobilon®, polyacryl- 

amide, plastic, metal and paper) without losing its 
binding capacity. The binding properties are not influ- 
enced by marking with radioactive substances, biotin or 
alkaline phosphatase. (The binding abilities of the 
v 35 protein LG are disclosed in Example 3) . 
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The protein comprises 43 2 amino acids and has a molecu- 
lar weight of 50 kDa deriving therefrom. The sequence is 
constructed of an ala sequence of the three last amino 
acids in the A -domain of the protein L (val-glu-asn) , 
this ala sequence being unrelated to the two proteins, 
whereafter the four mutually high-grade homologous B- 
domains from protein L follow. The first of the B-do- 
mains is comprised of 76 amino acids, and the remaining 
domains are each comprised of 72 amino acids. The first 
nine amino acids from the fifth B-domain are included 
and followed by two non-related amino acids (pro-met) . 
The protein G-sequences then follow. The last amino acid 
in the so-called S-domain from protein G is followed by 
an IgG-binding domain from protein G (CI; 55 amino 
acids) , the intermediate D-region (15 amino acids) and 
the second IgG-binding C-domain (C2; 55 amino 
acids) . The last amino acid is a methionine, which 
occurs in natural protein G as the first amino acid in 
the so-called W-region. 

The invention also relates to DNA-sequences which code 
for the aforesaid proteins. 

The gene which codes for the IgG-binding amino acid 
sequences can be isolated from the chromosomal DNA from 
Staphylococcus aureus based on the information on the 
DNA-sequence for protein A (S. Lofdahl, B. Guss, M. 
Uhlen, L. Philipsson and M. Lindberg. 1983. Gene for 
staphylococcal protein A. Proc. Natl. Acad. Sci. USA. 
80: 697-701) and Figure 6, or from G-streptococcus , 
preferably strain G 148 or C-streptococcus , preferably 
strain Streptococcus equisimilis C 40, based on the 
information on protein G (B. Guss, M. Eliasson, 
A. Olsson, M. Uhlen, A.-K. Fre j , H. Jorvall, I. Flock 
and M. Lindberg. 1986. Structure of the IgG-binding 
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regions of streptococcal protein G. EMBO. J. 5: 1567- 
1575) and Figure 6, or from group A-streptococcus, 
e.g. S . pyogenes (type Ml) based on the information on 
the DNA-sequence for protein H (H. Gomi, T. Hozumi, 
5 S. Hattori, C. Tagawa, F. Kishimoto and L. Bjorck. 1990. 

The gene sequence and some properties of protein H - a 
novel IgG binding protein J. Immunol. 144: 404 6-4052) 
and Figure 6, or from the chromosomal DNA in group A- 
streptococcus type Ml based on the information on the 

10 DNA-sequence for protein M (Applicant's Patent Applica- 

tion, PCT/SE 91100447) and Figures 6 and 7. The gene 
which codes for the protein that binds to light chains 
can be isolated from the chromosomal DNA from Pepto-co- 
ccus maanus 312 based on the information on the DNA— 

15 sequence for protein L in Claim 2. 

By using the chromosomal DNA 't obtained from the afore- 
said bacteria as a template, a DNA-fragment defined with 
the aid of two synthetic oligonucleotides can then be 

20 specifically amplified with the aid of PCR (Polymerase 

Chain Reaction) . This method also enables recognition 
sites to be incorporated for restriction enzymes in the 
ends of the amplified fragments (PCR technology, Ed: PCR 
Technology. Principles and Applications for DNA Amplifi- 

25 cation. Ed. Henry Erlich. Stockton Press, New York, 

1989) . The choice of recognition sequences can be adapt- 
ed in accordance with the vector chosen to express the 
fragment or the DNA-fragment or other DNA- fragments with 
which the amplified fragment is intended to be combined. 

30 The amplified fragment is then cleaved with the restric- 

tion enzyme or enzymes concerned and is combined with 
the fragment/the other fragments concerned and the 
fragments are then cloned together in the chosen vector 
(in this case, the expression vector) (Sambrook, J.E. 

35 Fritsch and T. Maniatis, 1989, Molecular cloning: A 

laboratory manual, 2nd Ed. Cold Spring Harbor Laborato- 
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ries, Cold Spring Harbor, New York, USA) . The plasmid 
vector pHD313 can be used (Dalboge, H.E. Bech Jensen, 
H. Tottrup, A. Grubb, M. Abrahamson, I. Olafsson and S. 
Carlsen, 1989. High-level expression of active human 
5 cystatin C in Escherichia coli. Gene, 79: 325-332), 

alternatively one of the vectors in the so-called PET- 
series (PET 20, 21, 22, 23) retailed by Novagen (Madi- 
son, Wisconsin, USA) . 

10 The hybrid proteins are then incorporated in an appro- 

priate host, preferably E. coli. The invention also 
relates to such hosts as those in which the hybrid 
proteins are incorporated. 

15 Those clones which produce the desired proteins can be 

selected from the resultant transf ormants with the aid 
of a known method (Fahnestock et al. , J. Bacteriol. 167, 
870 (1986). 

When the proteins that can bind to the light chains in 
the immunoglobulins and to the heavy chains in IgG have 
been purified from the resultant positive clones with 
the aid of conventional methods, the binding specifici- 
ties of the proteins are determined for selection of 
those clones which produce a protein that will bind to 
the light chains in immunoglobulins and to the heavy 
chains in IgG. 

Subsequent to having isolated plasmid DNA't in said 
3 0 clone with conventional methods, the DNA-sequence in the 

inserted material is determined with known methods 
(Sanger et al., Proc. Natl. Acad. Sci. USA 74, 5463 
(1977) . 

3 5 The invention also relates to DNA-sequences which hy- 

bridize with said identified DNA-sequences under conven- 



20 



25 
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tional conditions and which code for a protein that 
possesses the desired binding properties. Strict hybrid- 
izing conditions are preferred. 

5 Expression of the genes can be effected with expression 

vectors which have the requisite expression control 
regions, the structural gene being introduced after said 
regions. As illustrated in Figure 1 and Claim 2, the 
structural gene can be used for protein LG or other 
10 hybrid proteins with protein L. 

With regard to expression vectors, different host-vec- 
tor-systems have been developed, of which the most 
suitable host-vector-systems can be selected for expres- 
15 sion of the genes according to the present invention. 

The present invention also relates to a method of pro- 
ducing the inventive hybrid proteins by cultivating a 
host cell which is transformed with an expression vector 
20 in which DNA't which codes for the proteins according to 

the invention is inserted. 

This method includes the steps of 

25 (1) inserting into a vector a DNA-fragment which codes 

for the hybrid proteins; 

(2) transforming the resultant vector into an appropri- 
ate host cell; 

30 

(3) cultivating the resultant, transformed cell for 
preparation of the desired hybrid protein; and 

(4) extracting the protein from the culture. 

35 
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In the first step, the DNA- fragment which codes for the 
hybrid protein is insert d in a vector which is suitable 
for the host that is to be used to express the hybrid 
protein. The gene can be inserted by cleaving the vector 
5 with an appropriate restriction enzyme, and then legat- 

ing the gene with the vector. 

In the second step, the vector with the hybrid plasmid 
is inserted into host cells. The host cells may be 
10 Escherichia, coli. Bacillus subtilis or Saccharomvces 

cerevisiae or other suitable cells. Transformation of 
the expressions hybrid vector into the host cell can be 
effected in a conventional manner and clones which have 
been transformed can then be selected. 



15 



20 



In the third step, the obtained transf ormants are culti- 
vated in an appropriate medium for preparation of the 
desired proteins by expression of the gene coded for the 
hybrid protein. 



In the fourth step, the desired protein is extracted 
from the culture and then purified. This can be achieved 
with the aid of known methods. For instance, the cells 
can be lysed with the aid of known methods, by treating 

2 5 the cells with ultrasonic sound, enzymes or by mechani- 

cal degradation. The protein which is released from the 
cells or which excretes in the medium can be recovered 
and purified with the aid of conventional methods often 
applied within the biochemical field, such as ion-ex- 

30 change chromatography, gel filtration, affinity chroma- 

tography with the use of immunoglobulins as ligands, 
hydrophobic chromatography or reverse-phase chromato- 
graphy. These methods can be applied individually or in 
suitable combinations. 

35 
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As before mentioned, the inventive proteins may be used 
for binding, identifying or purifying immunoglobulins. 
Th y can also be bound to pharmaceuticals and used in 
formulations which have delayed release properties. To 
5 this end, the protein may be present in a reagent appli- 

ance for pharmaceutical composition in combination with 
appropriate reagents, additives or carriers. 

The proteins can be handled in a freeze-dried state or 
10 in a PBS-solution (phosphate-buffered physiological salt 

solution) pH 7.2 with 0.02% NaN^ . It can also be used 
connected to a solid phase, such as carbohydrate-based 
phases, for instance CNBr-activated sepharose, agarose, 
plastic surfaces, polyacrylamide, nylon, paper, magnetic 
15 spheres, filter, films. The proteins may be marked with 

biotin, alkaline phosphatase, radioactive isotopes, 
fluorescein and other fluorescent substances, gold 
particles, ferritin, and substances which enable lumi- 
nescence to be measured. 

20 

Other proteins may also be used as carriers. These 
carriers may be bound to or incorporated in the pro- 
teins, in accordance with the invention. For instance, 
it is conceivable to consider the whole of proteins A, 
25 G, H, M as carriers for inserted sequences of protein L 

which bind to light chains. In turn, these carriers can 
be bound to the aforesaid carriers. 

The pharmaceutical additions that can be used are those 
30 which are normally used within this field, such as 

pharmaceutical qualities of mannitol, lactose, starch, 
magnesium stearate, sodium saccharate, talcum, cellu- 
lose, glycose, gelatine, saccharose, magnesium carbonate 
and similar extenders, such as lactose, dicalcium phos- 
35 phate and the like; bursting substances, such as starch 

or derivatives thereof; lubricants such as magnesium 
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stearate and the like; binders, such as starch, gum 
aribicum, polyvinylpyrrolidone, gelatine, cellulose and 
derivatives thereof, and the like. 

5 The invention will now be described in more detail with 

reference to the accompany drawings, in which 

Figure 1 illustrates the plasmid pHD389; the ribosomal 
binding sequence, the sequence for the signal peptide 
10 from ompA and recognition sequence for several restric- 

tion enzymes are shown; 

Figure 2 illustrates the amino acid and nucleic acid 
sequence for protein LG; 

15 

Figure 3 is a schematic overall view of the production 
of protein L; 

Figure 4 is a schematic overall view of the production 
20 of protein LG; 

Figures 5a, 5b and 5c are schematic overall views of the 
production of the hybrid proteins LA, LM and LH respec- 
tively; 

25 

Figure 6 is a schematic inclusive illustration of pro- 
tein A, G, H and Ml. IgGFc-binding domains are for 
protein A: E, D, A, B and C; for protein G: CI, C2 and 
C3; for protein H: A and/or B; and for protein Ml: A, 
30 Bl, B2, B3 and S; 

Figure 7 illustrates the amino acid and nucleic acid se- 
quence for protein Ml; 
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Figure 8 illustrates Western Blot for protein G, L and 
LG with certain immunoglobulins and immunoglubulin 
fragments; and 

5 Figure 9 illustrates Slot-Blot for protein L, G and LG 

with IgG, Igx and Ig Fc. 

The amino acid and nucleic acid sequence of the light- 
chain binding domains of protein L is illustrated in 
10 Claims 1 and 2 respectively. 

It will be observed that the drawings are not to scale. 
Example 1 

15 

Cloning and expression of the igG-light-chain-binding 
domains in Protein L . 

Construction of synthetic oligonucleotides (primers) for 
20 amplifying sequences coded for protein I*, domain B1-B4 

It has been found that a protein L peptide (expressed in 
E. coli l constructed of the sequence ala-val-glu-asn- 
domain Bl (from protein L) binds to the light chains of 

25 the immunoglobulins (W. Kastern, U. Sjobring and L. 

Bjorck. 1992. Structure of peptostreptococcal protein L 
and identification of a repeated immunoglobulin light 
chain-binding domain. J. Biol. Chem. in-print) . Since 
this simple protein L-domain has a relatively low aff in- 

30 ity to Ig, (1 x 10 7 m" 1 ) , and since the naturally occur- 

ring protein L which is constructed of several mutually 
similar domains (B1-B5) has a high affinity to Ig (1 x 
10 10 m" 1 ) four of these domains have been expressed 
together in the following way: 

35 
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PL-N and PL-C1 are synthetic oligonucleotides (manufac- 
tured by the B i onto 1 ecu lar Unit at Lund University (Swe- 
den) in accordance with Applicant's instructions) which 
have been used to amplify a clonable gene fragment which 
5 is amplified with PCR (Polymerase Chain Reaction) and 

which codes for four Ig-binding protein L domains (ala- 
val-glu-asn-Bl-B2-B3-B4-lys-lys-val-asp-glu-lys-pro-glu- 
glu) • Amino acids in the protein L-sequence are given 
for the primer which corresponds to the coded strand 
10 (PL-N) : 



PL-N : 5 ' — GCTCAGGCGGCGCCGGTAGAAAATAAAGAAGAAACACCAGAAAC— 3 ' 

va lg luasn ly sg lug luthrpr og lu 

15 

5 '-end of this oligonucleotide is homologous with the 
coded strand in the protein L-gene (emphasized) : those 
codons which code for the last three amino acids in the 
A-domain (val-glu-asn) are followed by the codons for 
20 the first six amino acids in the first of the Ig-binding 

domains in protein L (Bl) . 

PL-Cl : 5 ' -CAGCAGCAGGATTC TTATTAT TCTTCTGGTTTTTCGTCAACTTT 
CTT-3 ' 

25 

This oligonucleotide is homologous with the opposing 
non-coding strand in the gene for protein L (the se- 
quence corresponds to the first nine amino acids in 
domain B5) . 

30 

DNA-f ragments which have been amplified with the aid of 
PL-N contain the recognition sequence for the restric- 
tion enzyme HpalX (emphasized) immediately before the 
codon which is considered to code for the first amino 
35 acid (val) in the expressed protein L-fragment. The 

fragment which is cleaved with HpalX can be ligated with 
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DNA (in this case, consisting of the used expression 
vector pHD389) which has be n cleaved with the restric- 
tion enzyme Narl. The DNA-fragment that has been cleaved 
with Hpall and ligated with vector pHD389, which has 
5 been cleaved with Narl, will be translated in the cor- 

rect reading frame. The construction results in trans- 
lation of an additional amino acid (ala) immediately in 
front of the first amino acid in protein L. 

DNA-fragments which have been amplified with the aid of 
PL-C1 will contain the recognition sequence for the 
restriction enzyme BamHI (over lined above the sequence) 
immediately after the sequence which codes for the last 
amino acid in the expressed protein L-fragment (glu) . 
The vector pHD389 contains a unique recognition sequence 
for BamHI as part of its so-called multiple cloning 
sequence which follows the Narl recognition sequence. 
DNA-fragments which have been amplified with the aid of 
PL-C1 will include two so-called stop-codons (empha- 
sized) which results in translation of the fragment 
inserted in the vector to cease. 

The sequence which was considered to be amplified con- 
tains no internal recognition sequences for the restric- 
25 tion enzymes Hpall or BamHI. 

Amplifying and cloning procedures 

(PCR) (Polymerase Chain Reaction) was effected with a 
30 protocol described by Saiki, R.D. Gelfand, S. Stoffel, 

S. Scharf , R. Higuchi, G. Horn, K. Mullis and H. Erlich, 
1988 ; Primer-directed enzymatic amplification of DNA 
with a thermostable DNA polymerase. Science 239: 487- 
49127; PCR was effected in a Hybaid Intelligent Heating- 
35 block (Teddington, UK) : 100 /il of a reaction mixture 

contained 50 mM KC1, 10 mM Tris-HCl, pH 8.3, 1.5 mM 



10 



15 



20 
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MgCl , 100 M/ml gelatine, 3 00 mM with respect to each of 
the deoxynucleotides (dATP, dCTP, dGTP , dTTP) , (Pharma- 
cia) , 2 0 pmol of each of the oligonucleotides PL-N and 
PL-C1, 10 Ml of a target (template) DNA-solution con- 
5 taining 0.1 mg/ml of chromosomal DNA from Pepto- 

streptococccus maanus . species 312. The mixture was 
covered with mineral oil (Sigma) and DNA' t was denatured 
by heating to 98 °C for 10 minutes. 2.5 units of AmpliTaq 
(Perkin Elmer Cetus, Norwalk, CT) were added and PCR was 

10 then carried out with 2 5 cycles consisting of a denatur- 

ing step at 94 °C for 1 minute, followed by a hybridizing 
step at 56°C for 1 minute, and finally by an extension 
step at 72 °C for 1 minute. Amplified DNA was analyzed by 
electrophoresis in agarose gel. The amplified DNA' t was 

15 cleaved with the restriction enzymes HpalX (Promega) , 

(8 units//xg amplified DNA) and BamHI (Promega), (10 
units//ig amplified DNA) at 37 °C. The thus amplified and 
subsequently cleaved DNA-product was isolated by elec- 
trophoresis in a 2% (weight by volume) agarose gel 

20 (NuSieve agarose, FMC Biproducts) in a TAE-buffer (40 Mm 

Tris, 20 Mm Na-acetate, 2 Mm EDTA, Ph 8.0). The result- 
ing 930 base-pair large fragment was cut from the gel. 
The DNA concentration in this removed gel-piece was 
estimated to be 0.05 mg/ml. The agarose-piece containing 

25 the cleaved, amplified fragment was melted in a water 

bath at 65 °C, whereafter the fragment was allowed to 
cool to 37 °C. 10 Ml (0.5 M9) of this DNA was transferred 
to a semimicrotube (Sarstedt) preheated to 37 °C, where- 
after 1 /il of the vector pHD389 was immediately added 

30 and cleaved with Narl (Promega) and BamHI, 1 /il 

lOxligas-buf fer (Promega and 1 Ml T4 DNA-ligase (Pro- 
mega; 1 unit//xl) . The ligating reaction was then used to 
transform E. coli , strain LE392, which had been compe- 
tent in accordance with the rubidium/calcium-chloride- 

35 method as described by Kushner (1978). Molecular biolog- 

ical standard methods have been used in the manipulation 
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of DNA (Sambrook, J.E. Fritsch and T. Maniatis, 1989. 
Molecular cloning: A laboratory manual. 2nd Ed. Cold 
Spring Harbor Laboratories, Cold Spring Harbor, N w 
York, USA) . The cleaving and ligating conditions recom- 
5 mended by the manufacturer of DNA-ligase and restriction 

enzymes have been followed in other respects. 

Expression system 

10 The vector pHD389 (see Figure 2) is a modified variant 

of the plasmid pHD313 (Dalboge, H.E. Bech Jensen, H. 
Tottrup, A. Grubb, M. Abrahamson, I. Olafsson and 
S. Carlsen, 1989. High-level expression of active human 
cystatin C in Escherichia coli. Gene, 79: 325-332). The 

15 vector, which is replicated in E. coli (contains ori = 

origin of replication from plasmid pUC19) is constructed 
so that DNA-fragments which have been cloned into the 
cleaving site of NarZ will be transcribed and translated 
downstream of and in the immediate vicinity of the 

20 signal peptide (21 amino acids) , from envelope-protein 

ompA from E. coli . Translation will be initiated from 
the codon ATG which codes for the first amino acid 
(methionine) in the signal peptide. This construction 
permits the translated peptide to be transported to the 

25 periplasmic space in E. coli . This is advantageous, 

since it reduces the risk of degradation of the desired 
product of enzymes occurring intracellular ly in E. coli . 
Moreover, it is easier to purify peptides which have 
been exported to the periplasic space. Unique recog- 

3 0 nit ion sequences (multiple cloning sequences) for sever- 

al other restriction enzymes, among them ecoRI, Sail and 
BamHI are found immediately after the NarZ cleaving 
site. An optimized so-called Shine-Dalgarno-sequence 
(also called ribosomal binding site, RBS) is found seven 

35 nucleotides upstream from the ATG-codon in the signal 

sequence from ompA, this optimized s quence binding to a 
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complementary sequenc in 16S rRNA in the ribosomes and 
is responsible for the translation being initiated in 
the correct place* The transcription of such DNA as that 
which is co-transcribed with the signal sequence for 
5 ompA is controlled by the P -promotor from coliphage X. 

The vector also contained the gene for CI857 from coli- 
phage X whose product down-regulates transcription from 
P R (and whose product is expressed const i tut ively) . This 
cI8 57 -mediated down-regulation of transcription from P R 

10 is heat-sensitive. The transcription regulated from this 

promotor is terminated with the aid of a so-called rho- 
independent transcription terminating sequence (forms a 
structure in DNA' t which results in the DNA-dependent 
RNA-polymerase leaving the DNA-strand) which is placed 

15 in the vector immediately downstream of the multiple 

cloning sequence. The plasmid also carries the B-lacta- 
mase gene (from the plasmid pUC19) whose product permits 
ampicillin-selection of E. coli clones that have been 
transformed by the vector. 



20 



Selection of protein L-producing clones 



The transformed bacteria are cultivated, or cultured, on 
culture plates with an LB-medium which also contained 

25 ampicillin in a concentration of 100 /xg/ml. Cultivation 

of the bacteria progressed overnight at 3 0°C, whereafter 
the bacteria were transferred to an incubator where they 
were cultivated for a further 4 hours at 42 °C. The 
plates were kept in a refrigerator overnight. On the 

30 next day, the colonies were transferred to nitrocellu- 

lose filters. Filters and culture plates were marked so 
as to enable the transferred colonies to be readily 
identified on respective culture plates. The culture 
plates were again incubated overnight at 30°C, so that 

35 remaining rests of transferred bacteria colonies could 

again grow. The plates were then kept in a refrigerator. 
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The bacteria in the colonies on the nitrocellulose- 
impressions were lysed by incubating the filter in 10% 
SDS for 10 minutes. Filters containing lysed bacteria 
were then rinsed with a blocking buffer which comprised 
5 PBS (pH 7.2) with 0.25% gelatine and 0.25% Tween-20 

(four baths, 250 ml each at 37°C) , whereafter the filter 
was incubated with radioactively marked (marked with 

125 

I in accordance with the chloramin-T-method) Ig-K- 
chains (20 ng/ml in PBS with 0.1% gelatine). The incuba- 

10 tion took place at room temperature over a period of 3 

hours, whereafter non-bound radioactively marked protein 
was rinsed-off with PBS (pH 7.2) containing 
0.5 M NaCl, 0.25% gelatine and 0.25% Tween-20 (four 
baths, 250 ml each at room temperature) . All filters 

15 were exposed to X-ray film. Positive colonies were 

identified on the original culture plate. Clones which 
reacted with Ig-K-chains were selected and analyzed with 
respect to the size on the DNA-fragment introduced in 
the vector. One of these clones was selected for the 

20 production of protein L, pHDL. The DNA't introduced from 

this clone into plasmid pHD389 was sequenced. The DNA- 
sequence was found to be in full agreement with corre- 
sponding sequences (B1-B4 and 21 bases in B5) in the 
gene for protein L from Peptostreptococcus maanus . 

25 strain 312. The size and binding properties of the 

protein produced by clone pHDL was analyzed with the aid 
of SDS-PAGE (see Figure 8) , dot-blot experiment (see 
Figure 9) and competitive binding experiments. 

30 Production of protein L 

Several colonies from a culture plate with E. coli pHDL 
were used to inoculate a preculture (LB-medium with an 
addition of 100 mg/1 ampicillin) , which was cultured at 
35 28 °C overnight. On the following morning, the preculture 

was transferred to a larger volume (100 times the volume 
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of the preculture) of fresh LB-medium containing ampi- 
cillin (100 mg/1) and was cultured in shake-flasks (200 
rpm) , (or fermentors) at 28 °C. The culture temperature 
was raised to 40 °C (induction of transcription) when the 
5 absorbency value at 620 nm reached 0.5. Cultivation then 

continued for 4 hours (applied solely to cultivation in 
shake-flasks) . Upon completion of the cultivation pro- 
cess, the bacteria were centrifuged down. The bacteria 
were then lysed with an osmotic shock method at 4°C 

10 (Dalboge et al., 1989 supra). The lysate was adjusted to 

a pH = 7. Remaining bacteria rests were then centrifuged 
down, whereafter the supernatent was purified on IgG- 
sepharose in accordance with earlier described protocol 
for protein G and protein L (U. Sjobring, L. Bjorck and 

15 W. Kastern. 1991. Streptococcal protein G: Gene struc- 

ture and protein binding properties. J. Biol. Chem. 266: 
399-405; Kastern, U. Sjobring and L. Bjorck. 1992. 
Structure of peptostreptococcal protein L and identifi- 
cation of a repeated immunoglobulin light chain-binding 

20 doman. J. Biol. Chem. in-print. 

The expression system gave about 20 mg/1 of protein L 
when cultivation in shake-flasks. The culture was depos- 
ited at DSSM, Identification Reference DSSM E. coli 
25 LE3 9 2 / pHDL . 
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Example 2 

Cloning and expression of p rotein LG 

5 Construction of oligonucleotides (primers) for amplify- 

ing sequences which code for protein LG 

Protein L 

10 It has been found that a protein L-peptide (expressed in 

E. coli ) constructed of the sequence ala-val-glu-asn- 

domain Bl (from protein L) will bind to the light chains 

of the immunoglobulins (Kastern, Sjobring and Bjorck, 

1992, J. Biol, Chem. in-print). Since the affinity of 

-7 - 

15 this simple domain to Ig is relatively low (1 x 10 M 

X ) and since the naturally occurring protein L, which is 
comprised of several mutually similar domains (B1-B5) 
has a higher affinity to Ig (1 x 10 10 m" 1 ) , four of 
these domains have been expressed together in the fol- 

20 lowing way: 

PL-N and PL-C2 are synthetic oligonucleotides (manufac- 
tured at the Biomolecular Unit at Lund University (Swe- 
den) in accordance with Applicant's instructions) which 
25 were used, with the aid of PCR (Polymerase Chain Reac- 

tion) to amplify a clonable gene fragment, called Bl-4, 
which codes for four Ig-binding protein L domains (ala- 
val-glu-asn-Bl-B2-B3-B4-lys-lys-val-asp-glu-lys-pro-glu- 

glu) : 

30 

PL-N : 5 ' — GCTCAGGCGGCGCCGGTAGAAAATAAAGAAGAAACACCAGAAAC- 3 ' 

va lg luasn ly sg lug luthrpr og lu 

P1-C2 : 5 ' — CAGCAGCAGCCA2GGGTTCTTCTGGTTTTTCGTCAACTTTCTTA- 
35 3' 
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Amino acids have been shown under corresponding triplets 
in the coded strand. DNA- fragments which have been 
amplified with the aid of PL-N contain the r cognition 
sequence for the restriction enzyme Hpall immediately 
5 upstream of the triplet which codes for the first amino 

acid (val) in the expressed protein L-fragment. The 
fragment that has been cleaved with Hpall can be ligated 
with DNA (in this case, the used expression vector 
pHD389) which has been cleaved with NarX. The construc- 

10 tion results in translation of an extra amino acid (ala) 

immediately upstream of the first amino acid in the 
protein L-fragment. The DNA-fragment that has been 
amplified with the aid of PL-C2 will contain the recog- 
nition sequence for the restriction enzyme Ncol (empha- 

15 sized) immediately downstream of the sequence which 

codes for the last amino acid in the expressed protein 
L-fragment (glu) • Amplified fragments which have been 
cleaved with Ncol can be ligated to the Ncol-cleaved, 
PCR-generated protein-asp-CDC-met-f ragment (see below) . 

20 



Protein G 



It is known that a simple C-domain from protein G will 

25 bind to IgG (B. Guss, M. Eliasson, A. Olsson, M. Uhlen, 

A.-K. Frej, H. Jornvall, I. Flock and M. Lindberg. 1986. 

Structure of the IgG-binding regions of streptococcal 

protein G. EMBO. J. 5: 1567-1575). The strength at which 

a simple C-domain binds to IgG is relatively low 

30 (5 x 10 7 m" 1 ) . A fragment which consists of two C-do- 

mains with an intermediate D-region having a length of 

15 amino acids , however, has a considerably higher 

9 -1 

affinity to IgG (1 x 10 M ) . CDC-N and CDC-C are 
oligonucleotides which have been used as PCR-primers to 
35 amplify a clonable DNA-fragment, designated CDC, which 
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codes for two IgG-binding protein G-domains (pro-met 
asp-CDC-met) . 

CDC-N : GGCCST5GACACTTACAAATTAATCCTTAATGGT 
me t a spthr ty r lysleuileleuasngly 

CDC-C : cAflflTgcSA CTTATTA CATTTCAGTTACCGTAAAGGTCTTAGT 



Amino acids in the resultant sequence have been shown 
10 beneath the primer of the coding strand. DNA-f ragments 

which have been amplified with the aid of CDC-N contain 
the recognition sequence for the restriction enzyme Ncol 
(marked with a line above the sequence) . Cleaved ampli- 
fied fragments can be ligated with the fragment that has 
15 been amplified with the aid of PL-C2 and then cleaved 

with Ncol. The fragment will therewith be translated to 
the correct reading frame, DNA-f ragments which have been 
amplified with the aid of CDC-C will contain two so- 
called stop condons (emphasized) which terminate trans- 
20 lation. The recognition sequence for the restriction 

enzyme Sail (marked with a line above the sequence) 
follows immediately afterwards, this sequence also being 
found in the expression vector pHD389 (see Figure 1) . 

25 Those sequences which code for the binding properties of 

protein L (B1-B5) and for protein G (CDC) respectively 
contain no internal recognition sequences for the re- 
striction enzymes Hpall, Sail or Ncol. 

30 Amplif if ication and cloning procedures 

PCR (Polymerase Chain Reaction) was carried out in 
accordance with a protocol described by Saiki et al. , 
1988; PCR was carried out in a Hybaid Intelligent Heat- 
35 ing-block (Teddington, UK) : 100 pi of the reaction 

mixture contained 50 mM KC1, 10 mM Tris-HCl, pH 8.3, 1.5 
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mM MgCl 2# 100 Mg/ml gelatine, 3 00 mM with respect to 
each of the deoxynucleotides (dATP, dCTP, dGTP, dTTP) , 
(Pharmacia) . In order to amplify sequences which code 
for the light-chain binding parts of protein L, there 
5 were added 20 pmol of each of the oligonucleotides PL-N 

and PL-C2, and 10 Ml of a DNA-solution which contained 
0.1 mg/ml of chromosomal DNA from Peptostr entococcus 
magnus, strain 312. By way of an alternative, 20 pmol 
were added to each of the oligonucleotide pairs CDC-N 
10 and CDC-C and 10 Ml of a DNA-solution which contained 

0.1 mg/ml of chromosomal DNA from a group C streptococ- 
cus strain ( Streptococcus eauisimilis ) called C40 
(U. Sjobring, L. Bjorck and W. Kastern. 1991. Strepto- 
coccal protein G: Gene structure and protein binding 
15 properties. J. Biol. Chem. 266: 399-405 or with Ncol and 

Sail (10 U//Ltg PCR-product) , (for CDC) at 37°C. The thus 
amplified and subsegently cleaved DNA-f ragments were 
then separated by electrophoresis in a 2% (weight by 
volume) agrose gel (NuSieve agarose, FMC Bioproducts) in 
20 a TAE-buffer (40 mM Tris, 20 mM aNa-cetate, 2 mM EDTA, 

pH 8.0). The resultant fragments, 930 bp (for Bl-4) 
and 390 bp (for CDC) were cut from the gel. The 
concentration of DNA in the thus separated gel pieces 
was estimated to be 0.05 mg/ml. The agarose pieces cut 
25 from the gel and containing the cleaved, amplified 

fragments (Bl-4 and CDC) were melted in a water bath at 
65°C, whereafter they were allowed to cool to 37 °C. 
10 Ml (0.5 M9) of this DNA were transferred to a semi- 
microtube (Sarstedt) , preheated to 37 °C, whereafter 1 Ml 
3 0 of the vector pHD3 89 which had been cleaved with Narl 

and Sail were added. 1 Ml 10 x ligase buffer (Promega) 
and 1 Ml T4 DNA-ligase (1 unit/MD were also added. The 
ligating reaction was permitted to take place at 37 °C 
for 6 hours. The cleaving and ligating conditions recom- 
3 5 mended by the producer of DNA-ligase and restriction 

enzymes (Promega) were followed in other respects. The 
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ligating reaction was then us d to transform E. coli, 
strain LE392, which had been made competent in accor- 
dance with the rubidium-chloride/calcium-dichloride 
method as described by Kushner (1978). Manipulation of 
5 DNA was effected in accordance with molecular biological 

standard methods (Sambrook et al., 1989). 

Expression system 

10 The vector pHD389 (see Figure 2) is a modified variant 

of the plasmid pHD313 (Dalboge et al. , 1989). The vector 
which was replicated in E. coli (contains origin of 
replication from plasmid pUC19) is constructed such that 
DNA-fragments which have been cloned in the cleaving 
15 site for NarX will be expressed immediately after, or 

downstream, of the signal peptide (21 amino acids) from 
the envelope protein ompA from E. coli. Translation will 
be initiated from the ATG-codon which codes for the 
first amino acid (methionine) in the signal peptide. The 
20 construction with an E . coli -individual signal sequence 

which precedes the desired peptide enables the translat- 
ed peptide to be transported to the periplasmic space in 
E. coli . This is beneficial since it reduces the risk of 
degradation of the desired product through the intracel- 
25 lular occurrent enzymes of E. coli . Furthermore, it is 

easier to purify peptides which have been exported to 
the periplasmatic space. Unique recognition sequences 
(multiple cloning sequences) for several other restric- 
tion enzymes, among them EcoRX, Sail and B amH I are 
3 0 present immediately downstream of the Marl cleaving 

site. An optimized so-called Shine-Dalgarno sequence 
(also called ribosomal binding site, RBS) is found seven 
nucleotides upstream of the ATG-codon in the signal 
sequence from ompA, this optimized Shbine-Dalgarno 
3 5 sequence binding to a complementary sequence in 16S rRNA 

in the ribosomes and in a manner to decide that th 
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translation is initiated in the correct place. The 
transcription of such DNA as that which is co-trans- 
cribed with the signal sequence for ompA is controlled 
by the P R -promotor from coliphage X. The vector also 
5 contains the gene for CI857 from coliphage X, the prod- 

uct of which regulates-down transcription from P R and 
the product of which is expressed constitutively . This 
cI8 5 7 -mediated down-regulation of transcription from P R 
is heat-sensitive. Transcription which is regulated, or 

10 controlled, from this promotor will be terminated with 

the aid of a so-called rho- independent transcription 
terminating sequence which is inserted in the vector 
immediately downstream of the multiple cloning site. The 
plasmid also carries the gene for 6-lactamase (from the 

15 plasmid pUC19) , the product of which permits ampicillin- 

selection of E. coli clones that have been transformed 
with the vector. 

Selection of protein LG-produced clones 

20 

The transformed bacteria are cultivated on culture 
plates with LB-medium which also contained ampicillin in 
a concentration of 100 /xg/ml. The bacteria were culti- 
vated overnight at 30 °C, whereafter they were trans- 

25 f erred to a cultivation cabinet (42 °C) and cultured for 

a further four (4) hours. The plates were stored in a 
refrigerator overnight. On the following day, the colo- 
nies were transferred to nitrocellulose filters. The 
filters and culture plates were marked, so that the 

30 transferred colonies could later be identified on the 

culture plate. The culture plates were again incubated 
overnight at 3 0 °C, so that rests of transferred bacteria 
colonies remaining on the plates could again grow. The 
plates were then stored in a refrigerator. The filter 

35 was incubated in 10% SDS for 10 minutes, so as to lyse 

the bacteria in the colonies on the nitrocellulose 
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impression. Filters containing lysed bacteria w re then 

rinsed with a blocking buffer consisting of PBS (pH 7.2) 

with 0.25% gelatine and 0.25% Tween-20 (four baths of 

250 ml at 37°C) , whereafter the filter was incubated 

125 

5 with radioactively (marked with I according to the 

chloromine-T-method) marked Ig-*c-chains (20 ng/ml) in 
PBS with 0.1% gelatine). The incubation process took 
place at room temperature for four (4) hours, whereafter 
non-bound radioactively marked protein was rinsed-off 

10 with PBS (pH 7.2) containing 0.5 M NaCl, 0.25% gelatine 

and 0.25% Tween-20 (four baths, 250 ml each at room tem- 
perature) . All filters were exposed to X-ray film. 
Positive colonies on the original culture plate were 
identified. A number of positive colonies were re- 

15 cultivated on new plates and new colony-blot experiments 

were carried out with these plates as a starting materi- 
al with the intention of identifying E . coli colonies 
which bind IgG Fc. These tests were carried out in 
precisely the same manner as that described above with 

20 respect to the identification of E. coli -colonies which 

expressed Ig light-chain-binding protein, with the 
exception that a radioactively marked ( I) IgG Fc (20 
ng/ml) was used as a probe. Clones which reacted with 
both proteins were selected and analyzed with regard to 

25 the size of the DNA-fragment introduced in the vector. 

One of these clones was chosen for production of protein 
LG, pHDLG. The DNA't taken from this clone and intro- 
duced into plasmid pHD389 was sequenced. The DNA-se- 
quence exhibited full agreement with corresponding 

30 sequences (B1-B4 and 21 bases in B5) in the gene for 

protein L from Peptostreptococcus maanus , strain 312, 
and with C1DC2 sequence in group C streptococcus strain 
C40. The size and binding properties of the protein 
produced from clone pHDLG was analyzed with the aid of 

35 SDS-PAGE (see Figure 8) , dot-blot experiment (see Figure 

10) and competitive binding experiments. 
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Production of pr tein LG 

Several colonies from a culture plate with E. coli pHDLG 
were used to inoculate a preculture (LB-medium with an 
5 addition of 100 mg/1 ampicillin) were cultivated at 28 °C 

overnight. In the morning, the preculture was trans- 
ferred to a larger volume (100 times the volume of the 
preculture) of fresh LB-medium containing ampicillin 
(100 mg/1) and was cultivated in vibrating flasks (200 

10 rpm) , (or fermenters) at 28 °C. When an absorbence value 

of 0.5 was reached at 620 nm f the cultivation tempera- 
ture was raised to 40 °C (induction of transcription) . 
The cultivation process was then continued for 4 hours 
(applies only to cultivation in vibrated flasks) . The 

15 bacteria were centrifuged down upon termination of the 

cultivation process. The bacteria were then lysed at 4°C 
in accordance with an osmotic shock method (Dalboge et 
al., T.989) . The lysate was adjusted to a pH of 7. Re- 
maining bacteria rests were centrifuged down and the 

20 supernatent then purified on IgG-sepharose, in accor- 

dance with the protocol earlier described with reference 
to protein G and protein L. (Sjobring et al. , 1991, 
Kastern et al., 1992). 

25 The expression system gave about 30 mg/1 of protein LG 

when cultivation in vibrated flasks. A deposition has 
been made at DSSM, Identification Reference DSSM E. coli 
LE3 92/ pHDLG • 

3 0 Example 3 

Analysis of the binding properties of protein LG 

Western Blot 

35 

Protein G (the ClDC2-f ragment) , protein L (four B- 



SUBSTITUTE SHEET 



WO 93/22342 



PCT/SE93/0037S 



31 

domains) and protein LG w re isolated with SDS-PAGE (10% 
acrylamide concentration) . The isolated proteins were 
transfered to nitrocellulose membranes in three similar 
copies (triplicate) . Each of these membranes was incu- 
5 bated with radioactively marked proteins (20 ng/ml: one 

of the membrane-copies was incubated with human poly- 
clonal IgG, another with human IgG Fc-fragment and the 
third with isolated human IgG xchains. Non-bound radio- 
actively marked proteins were rinsed off and all filters 
10 were then exposed to X-ray film. 

Slot-blot 

Human polyclonal Ig-preparations and Ig-fragments were 
15 applied with the aid of a slot-blot appliances on nitro- 

cellulose filters in given quantities (see Figure 10) on 
three similar copies. Each of these membranes was incu- 
bated with radioactively marked proteins (20 ng/ml). One 
of the membrane copies was incubated with protein LG, 
20 another with protein L and the third with protein G. 

Non-bound radioactively marked proteins were rinsed-of f 
and all filters were then exposed to X-ray film. 

The results are shown in Figures 9 and 10. 

25 

Other binding experiments have been carried out, with 
the following results: 
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Binding of the proteins G, L and LG to immunoglobulins. 

Binding protein: G K L K LG K 

a a 

Immunoglobulin 



Human: 

Polyclonal igG* + 67 (10) + 9-0 + 20 

IgG subclasses , 

+ 2.0 + + 

Ig G l +3.1 + + 

IgG 3 +6.1 + + 

IgG 4 +4.7 + + 

IgG fragment 

Fc* + 6.0 (0.5) - + 



F(ab') 2 * + °- 4 (°« 2 ) + 

kappa - + 
lambda - I ~ > 



1.5 



Other Ig-classes 
IgM 
IgA 
IgE 
IgD 



+ 11.6 + 
+ 10.4 + 

+ + 



+ 



Other Species ; 
Polyclonal 

Monkey + + 

Rabbit IgG +70 +0.074 + 

IgG-Fc +3.0 - + 

IgG-F(ab') 2 +0.44 + 

Mouse +41 +2.6 + 

Rat + 1.5 + 0.39 + 

Goat +14 - + 
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TABLE (cont'd.) 

Binding of the proteins G, L and LG to immunoglobulins. 

Binding protein: G K L K LG K 

a a 



Immunoglobulin 



Bovine IgG 1 +3 - + 

IgG 2 + 2 - + 

Horse + + 

Guinea Pig + + + 

Sheep + + 

Dog + + 

Pig + + + 

Hamster + 
Cat 
Hen 

Monclonals 
Mouse 

IgG^^ + + + 

IgG^ 

IgG 2b 

igc 3 + + 

IgM 
IgA 
Rat 

IgG 2b 
IgG 2c 



+ + 
+ + 

+ + 
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K = affinity constant (M* 1 ). *The numerals within parenthesis 
disclose the affinity of a recombinant protein G comprised of two 
IgG-binding domains- # A weak bond to lambda chains exists. 
& Binding to PI and PLG depends on the type of light chain of Ig. 

It will thus be seen that the synthesized hybrid protein LG has a 
broad binding activity/specificity. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: HighTech Receptro AB 

(B) STREET: c/o Active, Skeppsbron 2 

(C) CITY: MALMO 

( E ) COUNTRY : SWEDEN 

(F) POSTAL CODE (ZIP) : 211 20 

(G) TELEPHONE: 040/35 07 00 

(H) TELEFAX: 040/ 23 74 05 

(I) TELEX: 32637 Active S 

(ii) TITLE OF INVENTION: Hybridprotein 



(iii) NUMBER OF SEQUENCES: 1 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: SE PCT/SE93/00375 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: SE 9201331-7 

(B) FILING DATE: 28-APR-1992 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDL, DSM 7054 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Ala Val Glu Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser 
15 10 15 

Glu Glu Glu Val Thr lie Lys Ala Asn Leu lie Phe Ala Asn Gly Ser 
20 25 30 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 40 45 
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Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
65 70 75 80 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 

85 90 95 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 110 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 140 

Thr Leu Asn lie Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro 
145 150 155 160 

Lys Glu Glu Val Thr lie Lys Ala Asn Leu lie Tyr Ala Asp Gly Lys 

165 170 175 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
210 215 220 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
225 230 235 240 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

245 250 255 

Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 

Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
275 280 285 

Thr lie Asn lie Arg Phe Ala Gly Lys Lys Val Asp Glu Lys Pro Glu 
290 295 300 

Glu 
305 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDL, DSM 7054 

V 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



GCGGTAGAAA 


ATAAAGAAGA 


AACACCAGAA 


ACACCAGAAA 


CTGATTCAGA 


50 


AGAAGAAGTA 


ACAATCAAAG 


CTAACCTAAT 


CTTTGCAAAT 


GGAAGCACAC 


100 


AAACTGCAGA 


ATTCAAAGGA 


ACATTTGAAA 


AAG CAA CATC 


AGAAGCTTAT 


150 


GCGTATGCAG 


ATACTTTGAA 


GAAAGACAAT 


GGAGAATATA 


CTGTAGATGT 


200 


TGCAGATAAA 


GGTTATACTT 


TAAATATTAA 


ATTTGCTGGA 


AAAGAAAAAA 


250 


CACCAGAAGA 


ACCAAAAGAA 


GAAGTTACTA 


TTAAAGCAAA 


CTTAATCTAT 


300 


GCAGATGGAA 


AAACACAAAC 


AGCAGAATTC 


AAAGGAACAT 


TTGAAGAAGC 


350 


AACAGCAGAA 


GCATACAGAT 


ATGCAGATGC 


ATTAAAGAAG 


GACAATGGAG 


400 


AATATACAGT 


AGACGTTGCA 


GATAAAGGTT 


ATACTTTAAA 


TATTAAATTT 


450 


GCTGGAAAAG 


AAAAAACACC 


AGAAGAACCA 


AAAGAAGAAG 


TTACTATTAA 


500 


AGCAAACTTA 


ATCTATGCAG 


ATGGAAAAAC 


ACAAACAGCA 


GAATTCAAAG 


550 


GAACATTTGA 


AGAAGCAACA 


GCAGAAGCAT 


ACAGATATGC 


TGACTTATTA 


600 


GCAAAAGAAA 


ATGGTAAATA 


TACAGTAGAC 


GTTGCAGATA 


AAGGTTATAC 


650 


TTTAAATATT 


AAATTTGCTG 


GAAAAGAAAA 


AACACCAGAA 


GAACCAAAAG 


700 


AAGAAGTTAC 


TATTAAAGCA 


AACTTAATCT 


ATGCAGATGG 


AAAAACTCAA 


750 


ACAGCAGAGT 


TCAAAGGAAC 


ATTTGCAGAA 


GCAACAGCAG 


AAGCATACAG 


800 


ATACGCTGAC 


TT ATT AG CAA 


AAGAAAATGG 


TAAATATACA 


GCAGACTTAG 


850 


AAGATGGTGG 


ATACACTATT 


AATATTAGAT 


TTGCAGGTAA 


GAAAGTTGAC 


900 


GAAAAACCAG 


AAGAATAATA 


A 






921 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDLG, DSM 7055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Val Glu Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser 
15 10 15 

Glu Glu Glu Val Thr lie Lys Ala Asn Leu lie Phe Ala Asn Gly Ser 
20 25 30 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 40 45 

Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
65 70 75 80 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 

85 90 95 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 110 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 140 

Thr Leu Asn lie Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro 
145 150 155 160 

Lys Glu Glu Val Thr lie Lys Ala Asn Leu lie Tyr Ala Asp Gly Lys 

165 170 175 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
210 215 220 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
225 230 235 240 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

245 250 255 

Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 
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Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
275 280 285 

Thr lie Asn lie Arg Ph Ala Gly Lys Lys Val Asp Glu Lys Pro Glu 
290 295 300 

Glu Pro Met Asp Thr Tyr Lys Leu lie Leu Asn Gly Lys Thr Leu Lys 
305 310 315 320 

Gly Glu Thr Thr Thr Glu Ala Val Asp Ala Ala Thr Ala Glu Lys Val 

325 330 335 

Phe Lys Gin Tyr Ala Asn Asp Asn Gly Val Asp Gly Glu Trp Thr Tyr 
340 345 350 

Asp Asp Ala Thr Lys Thr Phe Thr Val Thr Glu Lys Pro Glu Val lie 
355 360 365 

Asp Ala Ser Glu Leu Thr Pro Ala Val Thr Thr Tyr Lys Leu Val lie 
370 375 380 

Asn Gly Lys Thr Leu Lys Gly Glu Thr Thr Thr Lys Ala Val Asp Ala 
385 390 395 400 

Glu Thr Ala Glu Lys Ala Phe Lys Gin Tyr Ala Asn Asp Asn Gly Val 

405 410 415 

Asp Gly Val Trp Thr Tyr Asp Asp Ala Thr Lys Thr Phe Thr Val Thr 
420 425 430 

Glu Met 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichis coli L392/pHDLG, DSM 7055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCGGTAGAAA ATAAAGAAGA AACACCAGAA ACACCAGAAA CTGATTCAGA 50 

AGAAGAAGTA ACAATCAAAG CTAACCTAAT CTTTGCAAAT GGAAGCACAC 100 

AAACTGCAGA ATTCAAAGGA ACATTTGAAA AAGCAACATC AGAAGCTTAT 150 

GCGTATGCAG ATACTTTGAA GAAAGACAAT GGAGAATATA CTGTAGATGT 200 
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TGCAGATAAA GGTTATACTT TAAATATTAA 
CACCAGAAGA ACCAAAAGAA GAAGTTACTA 
GCAGATGGAA AAACACAAAC AGCAGAATTC 
AACAGCAGAA GCATACAGAT ATGCAGATGC 
AATATACAGT AGACGTTGCA GATAAAGGTT 
GCTGGAAAAG AAAAAACACC AGAAGAACCA 
AGCAAACTTA ATCTATGCAG ATGGAAAAAC 
GAACATTTGA AGAAGCAACA GCAGAAGCAT 
GCAAAAGAAA ATGGTAAATA TACAGTAGAC 
TTTAAATATT AAATTTGCTG GAAAAGAAAA 
AAGAAGTTAC TATTAAAGCA AACTTAATCT 
ACAGCAGAGT TCAAAGGAAC ATTTGCAGAA 
ATACGCTGAC TTATTAGCAA AAGAAAATGG 
AAGATGGTGG ATACACTATT AATATTAGAT 
GAAAAACCAG AAGAACCCAT GGACACTTAC 
AACATTGAAA GGCGAAACAA CTACTGAAGC 
AAAAAGTCTT CAAACAATAC GCTAACGACA 
ACTTACGACG ATGCGACTAA GACCTTTACA 
GATCGATGCG TCTGAATTAA CACCAGCCGT 
TTAATGGTAA AACATTGAAA GGCGAAACAA 
GAAACTGCAG AAAAAGCCTT CAAACAATAC 
TGGTGTTTGG ACTTATGATG ATGCGACTAA 
TGTAATAA 
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ATTTGCTGGA 


AAAGAAAAAA 


250 


TTAAAGCAAA 


CTTAATCTAT 


300 


AAAGGAACAT 


TTGAAGAAGC 


350 


ATTAAAGAAG 


GACAATGGAG 


400 


ATACTTTAAA 


TATTAAATTT 


450 


AAAGAAGAAG 


TTACTATTAA 


500 


ACAAAC AG C A 


GAATTCAAAG 


550 


ACAGATATGC 


TGACTTATTA 


600 


GTTGCAGATA 


AAGGTTATAC 


650 


AACACCAGAA 


GAACCAAAAG 


700 


ATGCAGATGG 


AAAAACTCAA 


750 


GCAACAGCAG 


AAGCATACAG 


800 


TAAATATACA 


GCAGACTTAG 


850 


TTGCAGGTAA 


GAAAGTTGAC 


900 


AAATTAATCC 


TTAATGGTAA 


950 


TGTTGATGCT 


GCTACTGCAG 


1000 


ACGGTGTTGA 


CGGTGAATGG 


1050 


GTTACTGAAA 


AACCAGAAGT 


1100 


GACAACTTAC 


AAACTTGTTA 


1150 


CTACTAAAGC 


AGTAGACGCA 


1200 


GCTAACGACA 


ACGGTGTTGA 


1250 


GACCTTTACG 


GTAACTGAAA 


1300 
1308 
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Claims 

1. Protein L having the ability to bind to the light 
chains of immunoglobulins, characterized 
in that the protein L has the following amino acid 
sequence : 



1 0 



1 5 



B1 



Ala Val Glu Asn 



s Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser 



10 



15 



Glu Glu Glu Val Thr lie Lys Ala Asn Leu He Phe Ala Asn Gly Ser 
20 25 30 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 40 *5 

Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu i«n He Lys Phe Ala Gly 

75 80 



65 



32 



70 



iLy 



s Glu Lyo Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 



85 



90 



95 



Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 HO 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 b3 140 

Thr Leu Asn He Lys Phe Ala Gly; Lys Glu Lys Thr Pro Glu Glu Pro 

155 I 60 



145 



150 



Lys Glu Glu Val Thr lie Lys Ala Asn Leu He Tyr Ala Asp Gly Lys 

165 170 * 75 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
* 205 



195 



200 



Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly 
710 215 * 20 
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B4 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 

225 230 235 240 

Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

5 245 250 255 

Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 

260 265 270 



10 



Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
275 280 B5 285 



Thr lie Asn lie Arg Phe Ala Gly! Lys Lys Val Asp Glu Lys Pro Glu 
290 295 3°° 



Glu 



15 



and variants, subf ragments , multiples or mixtures of the 
domains B1-B5 having the same binding properties. 

2. DNA-sequence , characterized in that 
20 it codes for the protein according to Claim 1 and has 

the following nucleotide sequence: 

GCG GTA GAA AAT AAA GAA GAA ACA CCA GAA ACA CCA GAA ACT GAT TCA 48 
25 GAA GAA GAA GTA ACA ATC AAA GCT AAC CTA ATC TTT GCA AAT GGA AGC 96 
ACA CAA ACT GCA GAA TTC AAA GGA ACA TTT GAA AAA GCA ACA TCA GAA 144 
GCT TAT GCG TAT GCA GAT ACT TTG AAG AAA GAC AAT GGA GAA TAT ACT 192 

240 

288 

AAC TTA ATC TAT GCA GAT GGA AAA ACA CAA ACA GCA GAA TTC AAA GGA 336 
ACA TTT GAA GAA GCA ACA GCA GAA GCA TAC AGA TAT GCA GAT GCA TTA 384 
35 AAG AAG GAC AAT GGA GAA TAT ACA GTA GAC GTT GCA GAT AAA C-GT TAT 432 



GTA GAT GTT GCA GAT AAA GGT TAT ACT TTA AAT ATT AAA TTT GCT GGA 

30 

AAA GAA AAA ACA CCA GAA GAA CCA AAA GAA GAA GTT ACT ATT AAA GCA 



ACT TTA AAT ATT AAA TTT GCT GGA AAA GAA AAA ACA CCA GAA GAA CCA 
AAA GAA GAA GTT ACT ATT AAA GCA AAC TTA ATC TAT GCA GAT GGA AAA 



480 
528 
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CAA 


ALA 


GCA 


L-AA 


1 i w 






ACA 


TTT 


GAA 


GAA 


GCA 


ACA 


GCA 


GAA 


576 




GCA 


I AC 


Ata A 


1 A 1 








TTA 


GCA 


AAA 


GAA 


AAT 


GGT 


AAA 


TAT 


ACA 


624 


5 


GTA 










2k A A 
n/in 


GGT 


TAT 


ACT 


TTA 


AAT 


ATT 


AAA 


TTT 


GCT 


GGA 


672 




AAA 


GAA 


AAA 


ACA 


CCA 


GAA 


GAA 


CLA 


AAA 






GTT 


ACT 


ATT 


AAA 


GCA 


720 




AAC 


TTA 


ATC 


TAT 


GCA 


GAT 


GGA 


AAA 


ACT 


CAA 


ACA 


GCA 


GAG 


TTC 


AAA 


GGA 


768 


10 


ACA 


TTT 


GCA 










GAA 


GCA 


TAC 


AGA 


TAC 


GCT 


GAC 


TTA 


TTA 


816 




GCA 


AAA 


GAA 


AAT 


GGT 


AAA 


TAT 


ACA 


GCA 


GAC 


TTA 


GAA 


GAT 


GGT 


GGA 


TAC 


864 




ACT 


ATT 


AAT 


ATT 


AGA 


TTT 


GCA 


GGT 


AAG 


AAA 


GTT 


GAC 


GAA 


AAA 


CCA 


GAA 


912 


15 


GAA 


TAATAA 
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3. 


A hybrid protein, 


c h 


a r 


a c 


t e 


r i 


z e 


d 


in 







that it includes one or more of the Bl-B5-domains ac- 
20 cording to Claim 1 which bind to the light chains in 

immunoglobulins of all classes, and domains which bind 
to heavy chains in immunoglobulin G. 

4. A hybrid protein according to Claim 3, char- 
25 acterized in that the domains which bind to 

heavy chains in immunoglobulin G are chosen from among 
the Cl- and C2 -domains in protein G or from among any 
other functionally similar proteins which bind to heavy 
chains in immunoglobulin G, and variants, subf ragments , 
30 multiples or mixtures thereof having the same binding 

properties. 

5. A hybrid protein according to Claim 4, char- 
acterized in that the hybrid protein has the 

35 following amino acid sequence: 
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10 



15 



.-.la Val C-iu Asn Lvs Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser 
1 *5 10 15 

Glu Glu Glu Val Thr lie Lys Ala Asn Leu He Phe Ala Asn Gly Ser 
20 25 30 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 40 45 

Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
65 70 75 80 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 

85 90 95 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 HO 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 140 

Thr Leu Asn He Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro 
145 150 155 160 

Lys Glu Glu Val Thr He Lys Ala Asn Leu He Tyr Ala Asp Gly Lys 

165 170 175 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly 
210 215 220 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
225 230 235 240 

35 Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

245 250 255 



20 



25 



30 
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Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 

Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
5 275 280 285 

Thr He Asn He Arg Phe Ala Gly Lys Lys Val Asp Glu Lys Pro Glu 
250 295 300 

■31u ?ro Met Asp Thr Tyr Lys leu lie Leu Asr. Gly Lys Thr Leu Lys 
10 205 310 315 -° 

Gly Glu Thr Thr Thr Glu Ala Val Asp Ala Ala Thr Ala Glu Lys Val 

325 330 335 



15 



20 



Phe Lys Gin Tyr Ala Asn Asp Asn Gly Val Asp Gly Glu Trp Thr Tyr 
340 345 350 

Asp Asp Ala Thr Lys Thr Phe Thr Val Thr Glu Lys Pro Glu Val He 
355 360 365 

Asp Ala Ser Glu Leu Thr Pro Ala Val Thr Thr Tyr Lys Leu Val He 
370 375 380 

Asn Gly Lys Thr Leu Lys Gly Glu Thr Thr Thr Lys Ala Val Asp Ala 
385 390 395 400 

Glu Thr Ala Glu Lys Ala Phe Lys Gin Tyr Ala Asn Asp Asn Gly Val 

405 410 415 

Asp Gly Val Trp Thr Tyr Asp Asp Ala Thr Lys Thr Phe Thr Val Thr 
" 420 425 430 

GlU Met 



30 and variants, subf ragments , multiples or mixtures of the 

domains B1-B5 having the same binding properties. 

6. DNA-sequence , characterized in that 
it codes for a protein according to Claim 5 and has the 
35 following nucleotide sequence: 
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GCG GTA GAA AAT AAA 
GAA GAA GAA GTA ACA 
ACA CAA ACT GCA GAA 
GCT TAT GCG TAT GCA 

5 

GTA GAT GTT GCA GAT 
AAA GAA AAA ACA CCA 
AAC TTA ATC TAT GCA 
10 ACA TTT GAA GAA GCA 
AAG AAG GAC AAT GGA 
ACT TTA AAT ATT AAA 
AAA GAA GAA GTT ACT 

15 

ACA CAA ACA GCA GAA 
GCA TAC AGA TAT GCT 
GTA GAC GTT GCA GAT 
20 AAA GAA AAA ACA CCA 
AAC TTA ATC TAT GCA 
ACA TTT GCA GAA GCA 
GCA AAA GAA AAT GGT 

25 

ACT ATT AAT ATT AGA 
GAA CCC ATG GAC ACT 
GGC GAA ACA ACT ACT 
30 TTC AAA CAA TAC GCT 
GAC GAT GCG ACT AAG 
GAT GCG TCT GAA TTA 
AAT GGT AAA ACA TTG 

35 

GAA ACT GCA GAA AAA 
GAT GGT GTT TGG ACT 
GAA ATG TAATAA 



46 

GAA GAA ACA CCA GAA ACA 
ATC AAA GCT AAC CTA ATC 
TTC AAA GGA ACA TTT GAA 
GAT ACT TTG AAG AAA GAC 
AAA GGT TAT ACT TTA AAT 
GAA GAA CCA AAA GAA GAA 
GAT GGA AAA ACA CAA ACA 
ACA GCA GAA GCA TAC AGA 
GAA TAT ACA GTA GAC GTT 
TTT GCT GGA AAA GAA AAA 
ATT AAA GCA AAC TTA ATC 
TTC AAA GGA ACA TTT GAA 
GAC TTA TTA GCA AAA GAA 
AAA GGT TAT ACT TTA AAT 
GAA GAA CCA AAA GAA GAA 
GAT GGA AAA ACT CAA ACA 
ACA GCA GAA GCA TAC AGA 
AAA TAT ACA GCA GAC TTA 
TTT GCA GGT AAG AAA GTT 
TAC AAA TTA ATC CTT AAT 
GAA GCT GTT GAT GCT GCT 
AAC GAC AAC GGT GTT GAC 
ACC TTT ACA GTT ACT GAA 
ACA CCA CCC GTG ACA ACT 
AAA GGC GAA ACA ACT ACT 
CCC TTC AAA CAA TAC GCT 
TAT GAT GAT GCG ACT AAG 
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CCA GAA ACT GAT TCA ^8 

TTT GCA AAT GGA AGC 96 

AAA GCA ACA TCA GAA 1^4 

AAT GGA GAA TAT ACT '92 

ATT AAA TTT GCT GGA 240 

GTT ACT ATT AAA GCA 288 

GCA GAA TTC AAA GGA 336 

TAT GCA GAT GCA TTA 384 

GCA GAT AAA GGT TAT 432 

ACA CCA GAA GAA CCA 480 

TAT GCA GAT GGA AAA 523 

GAA GCA ACA GCA GAA 576 

AAT GGT AAA TAT ACA 624 

ATT AAA TTT GCT GGA 672 

GTT ACT ATT AAA GCA 720 
GCA GAG TTC AAA GGA 768 
TAC GCT GAC TTA TTA 616 
GAA GAT GGT GGA TAC 864 
GAC GAA AAA CCA GAA 912 
GGT AAA ACA TTG AAA 960 

ACT GCA GAA AAA GTC 1008 

GGT GAA TGG ACT TAC 1056 

AAA CCA GAA GTG ATC 1104 
TAC AAA CTT GTT ATT 1152 
AAA GCA GTA GAC GCA 1200 
AAC GAC AAC GGT GTT 1248 
ACC TTT ACG GTA ACT 1296 

1308 
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7 . DNA-sequence , characterized in that 
it codes for a protein according to Claim 3 and 4. 

8. DNA-sequence, characterized in that 
5 it hybridizes to the DNA-sequence of Claim 2, 6 or 7 

under conventional conditions and codes for a protein 
which has the same binding properties as the protein 
according to any one of Claims 1 and 3-5 . 

10 9. A plasmid vector, characterized in 

that it includes a DNA-sequence according to any one of 
Claims 2 and 6-8, preferably the vector pHDLG or pHDL. 

10. A host cell, characterized in that it 
15 is transformed with the hybrid plasmid according to 

Claim 9, in particular a host which belongs to the 
species E. coli . particularly E, coli LE392, or ^acillus 
subtilis. Saccaro mvces cerevisiae, preferably Id. Ref . 
DSSM E. coli LE392 pHDL and E. coli LE3 92/ pHDLG res- 
20 pectively. 

11. A method for producing a protein according to 
Claims 1 and 3-5, characterized by cultiv- 
ating a host cell according to Claim 10 under suitable 

25 conditions; accumulating the protein in the culture or 

lysing the cells and extracting the protein therefrom. 

12. A reagent apparatus for binding, separating and 
identifying immunoglobulins, characterized 

3 0 in that it includes a protein according to any one of 

Claims 1 and 3-5. 

13. A composition, characterized in that 
it includes a protein according to any one of Claims 1 

35 and 3-5, and optionally additives or carriers. 
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14. A pharmaceutical composition, charac- 
terized in that it includes a protein according 
to any one of Claims 1 and 3-5, and optionally a pharma 
ceutically acceptable carrier or extender. 
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